Extracting author-specifi c expressions using random forest for use in the sociolinguistic analysis of political speeches

نویسنده

  • Takafumi Suzuki
چکیده

This study applies stylistic text classifi cation using random forest to extract author-specifi c expressions for use in the sociolinguistic analysis of political speeches. In the fi eld of politics, the style of political leaders’ speeches, as well as their content, has attracted growing attention in both English (Ahren, 2005) and Japanese (Azuma, 2006; Suzuki and Kageura, 2006). One of the main purposes of these studies is to investigate political leaders’ individual and political styles by analyzing their speech styles. A common problem of many of these studies and also many sociolinguistic studies is that the expressions that are analyzed are selected solely on the basis of the researcher’s interests or preferences, which can sometimes lead to contradictory interpretations. In other words, it is diffi cult to determine whether these kind of analyses have in fact correctly identifi ed political leaders’ individual speech styles and, on this basis, correctly characterised their individual and political styles. Another problem is that political leaders’ speech styles may also be characterised by the infrequent use of specifi c expressions, but this is rarely focused on.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Extracting speaker-specific functional expressions from political speeches using random forests in order to investigate speakers’ individual political styles

In this study we extracted speaker-specific functional expressions from political speeches using random forests in order to investigate speakers’ individual political styles. Along with methodological development, stylistics has expanded its scope into new areas of application such as authorship profiling and sentiment analysis in addition to conventional areas such as authorship attribution an...

متن کامل

(Un)Translatability of Persian Idiomatic Expressions to English in Political Discourse

The present study sought to investigate the extent to which Persian idiomatic expressions would influence the western translators' strategies in providing the ultimate product in English, and it also attempted to uncover the underlying assumptions in target text, then to suggest some weighty strategies to overcome difficulties with translation. For this purpose, the data was analyzed within the...

متن کامل

Author gender identification from text using Bayesian Random Forest

Nowadays high usage of users from virtual environments and their connection via social networks like Facebook, Instagram, and Twitter shows the necessity of finding out shared subjects in this environment more than before. There are several applications that benefit from reliable methods for inferring age and gender of users in social media. Such applications exist across a wide area of fields,...

متن کامل

A Critical Study of Selected Political Elites' Discourse in English

This study explored how political elites can contribute to power enactment through using language. It started with a theoretical overview of Critical Discourse Analysis (CDA), and then presented a corpus consisting of speeches of eight political elites, namely, Malcolm X, Noam Chomsky, Martin Luther King, Josef Stalin, Vladimir Lenin, Winston Churchill, J.F. Kennedy and Adolph Hitler. This stud...

متن کامل

IGGSA Shared Tasks on German Sentiment Analysis (GESTALT)

We present the German Sentiment Analysis Shared Task (GESTALT) which consists of two main tasks: Source, Subjective Expression and Target Extraction from Political Speeches (STEPS) and Subjective Phrase and Aspect Extraction from Product Reviews (StAR). Both tasks focused on fine-grained sentiment analysis, extracting aspects and targets with their associated subjective expressions in the Germa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009